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void callPage(String uri. HttpServIetRequest request) 



servlet = find servlet; 
HandleFragmentInfo(url. request, servlet); 
servlet.service(request, response); 



} 

void handleFragmentInfo(String urI, 

HttpServIetRequest request, 
HttpServlet servlet) 

try I 

FIG. 8 id = servlet.getCacheldQ; 

! catch (NoSuchMethodException e) | 
return; 

if (id == null) I 
return; 

\ 

fragnaentlnfo = new Fragnnentlnfo(); 
fragnaentlnfo.setld(id); 
Fragment[nfo.setUri(url); 
request.setFragmentlnfo(fragmentlnfo); 
return; 
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<jsp:directive.taglib uri="/xsp/tagbean/cache/taglib.xmr' prefix="cache"/> 
<cache:enable> 

<cachelt>!categoryld.equols{"produce")</cachelt> 

<id>"productOetail-'' + productId</id> 

<pinned>false</pinned> 

<aggressiveRerender>false</aggressiveRerender> 
<extemalCacheGroupId>*'group1"</externalCacheGroupId> 
<dataId>"product-" + productId</datald> 
</cache:enable> 

The equivalent JSP syntax without specialized tags would be: 
<%! public String 

^etCacheId(conf>.ibm.servlet.engine.SEHttpServiceRequest request) 

return ''productDetoil-*' f productid; 

%> 



FIG. 9 



<% Fragmentlnfo fragnnentlnfo = request.getFragmentlnfo(); 

frogmentlnfo.setCacheIt(!cotegoryId.equals("praduce")); 

fragnnentlnfo.setPin(false); 

fragmentlnfo.setAggressiveRerender(false); 

frogmentInfo.setExtemalCacheGroupId("groupr'); 
Vector vector = new VectorQ; 
vector.addElement("product-" + productid); 
fragnnentlnfo.setDatalds(vector); 



FIG. 10 
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FIG. 11 



<?xml version="1.0"?> 
<jsp:root> 

<jsp:directive.pQge errorpage="/xsp/test/errorPage.xsp"/> 
<jsp:directive.taglib uri='yxsp/tagbean/cache/cache-taglib.xfnr' 
prefix="cache'7> 

<%! 

public String 

^etCacheId(comJbm.servlet.engine.SEHttpServiceRequest request) 
return + "/xsp/test/cache/publishl/pagel.xsp"; 

%> ^ 
<% 

Vector vector = new Vector(); 

vector.addElement("product-2066-4"); 

vectoroddElement("shopperGrcup-2066-2"); 



Fragnaentlnfo frogmentlnfo = request.peekFragnnentInfo(); 
fragnnentInfo.setTimeLimit(-1 ); 
1 00 fragnnentlnfo.setPin(false); 

fragmentlnfo.setAggressiveRerender(fGlse); 

fragment! nfo.setExternalCocheGroupld("group1"); 
fragment]nfo.setOataIds(vector); 

%y 

<html> 
<head> 

<title>EnQble JSP caching</title> 
</head> 
<body> 

<% 

System.out.println("out = '* + out); 
%> 

Page 1 calling page 2<br/> 

<jsp:request include='y xsp/test/ cache/ publish 1 / poge2.xsp'7 > 
Page 1 returning from page 2<br/> 
</body> 
</html> 
</jsp:root> 

<?xml version="1.0"?> 
<jsp:rool> 

<jsp:directive.taglib uri="/xsp/tagbean/coche/cache-taglib.xml" 
prefix="cache'7> 

<%! 

public String 

ptCocheld(cGm.ibm.servlet.engine.SEHttpServiceRequest request) 
^ retum +"/xsp/test/cache/publishl/page2.xs"; 

%> 

<b>ln page 2</b><br/> 
</jsp:root> 
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METHOD AND APPARATTJS FOR 
MANAGING INTERNAL CACHES AND 
EXTERNAL CACHES IN A DATA 
PROCESSING SYSTEM 

CROSS REFERENCE TO RELATED 
APPUCAnONS 

This application is a divisional of application Ser. No. 
09/359,275, filed Jul. 22, 1999, now U.S. Pat. No. 6,507, 
891. 

The present invention is related to applications entitled 
METHOD AND APPARATUS FOR HIGH- 
CONCURRENCY CUENT LOCKING WITH JAVA IN A 
DATA PROCESSING SYSTEM, Ser. No. 09/359,274, 
METHOD AND APPARATUS FOR CACHE COORDINA- 
TION FOR MULTIPLE ADDRESS SPACES, Ser. No. 
09/359,276, METHOD AND APPARATUS FOR INVALI- 
DATING DATA IN A CACHE, Ser. No. 09/359,277, 
METHOD AND APPARATUS FOR AGGRESSIVELY 
RENDERING DATA IN A DATA PROCESSING 
SYSTEM, Ser. No. 09/359^78, and A METHOD AND 
APPARATUS FOR CACHING CONTENT IN A DATA 
PROCESSING SYSTEM WITH FRAGMENT 
GRANULARITY, Ser. No. 09/359,279, all of which are filed 
even date hereof, assigned to the same assignee, and incor- 
porated herein by reference. 

BACKGROUND OF THE INVENTION 

1. Technical Field 

The present invention relates generally to an improved 
data processing system and in particular to a method and 
apparatus for caching data in a data processing system. More 
specifically, the present invention relates to a method and 
apparatus for caching documents containing dynamic con- 
tent. 

2. Description of Related Art 

The Internet, also referred to as an "internetwork'*, is a set 
of computer networks, possibly dissimilar, joined together 
by means of gateways that handle data transfer and the 
conversion of messages from the sending network to the 
protocols used by the receiving network (with packets if 
necessary). When capitalized, the term "Internet" refers to 
the collection of networks and gateways that use the TCP/IP 
suite of protocols. 

The Internet has become a cultural fixture as a source of 
both information and entertainment. Many businesses are 
creating Internet sites as an integral part of their marketing 
efforts, informing consumers of the products or services 
offered by the business or providing other information 
seeking to engender brand loyalty. Many federal, state, and 
local government agencies are also employing Internet sites 
for informational purposes, particularly agencies which 
must interact with virtually all segments of society such as 
the Internal Revenue Service and secretaries of state. Pro- 
viding informational gtiides and/or searchable databases of 
online public records may reduce operating costs. Further, 
the Internet is becoming increasingly popular as a medium 
for commercial transactions. 

Currently, the most commonly employed method of trans- 
ferring data over the Internet is to employ the World Wide 
Web environment, also caUed simply "the Web". Other 
Internet resources exist for transferring information, such as 
File Transfer Protocol (FTP) and Gopher, but have not 
achieved the popularity of the Web. In the Web environment, 
servers and cHenls effect data transaction using the Hyper- 
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text Transfer Protocol (HTTP), a known protocol for han- 
dling the transfer of various data files (e.g., text, still graphic 
images, audio, motion video, etc.). The information in 
various data files is formatted for presentation to a user by 

5 a standard page description language, the Hypertext Markup 
Language (HTML). In addition to basic presentation 
formatting, HTML allows developers to specify "links" to 
other Web resources identified by a Uniform Resource 
Locator (URL). A URL is a special syntax identifier defining 

10 a communications path to specific information. Each logical 
block of information accessible to a client, called a "page" 
or a "Web page", is identified by a URL. The URL provides 
a \miversal, consistent method for finding and accessing this 
information, not necessarily for the user, but mostly for the 

15 user^s Web "browser". A browser is a program capable of 
submitting a request for information identified by a URL at 
the client machine. Retrieval of information on the Web is 
generally accomplished with an HTML-compatible browser. 
Web content is often dynamic because of various changes 

20 made by developers and other users publishing or making 
available web content, such as Web pages. Even static pages 
are occasionally updated. Web servers provide static content 
and dynamic content to various users. Static content contain 
data from files stored at a server. Dynamic content is 

25 constructed by programs executing at the time a request is 
made. The presence of dynamic content often slows down 
Web sites considerably. High-performance Web servers can 
typically deliver several hundred static pages per second. By 
contrast, the rate at which dynamic pages are delivered is 

30 often one or two order of magnitudes slower. 

Dynamic content is often present at a web site in an effort 
to provide customized pages and updated information to 
various users that may visit the site. The use of this type of 
Web page, however, may cause a web site to slow down in 
performance. 

Proxy caches are used to store data at sites that are remote 
from the server which originally provided the data. Proxy 
caches reduce network traffic and latency for obtaining Web 
data because clients can obtain the data from a local proxy 
^° cache instead of having to request the data directly from the 
site providing the data. This mechanism, however, does not 
work well with dynamic pages. One problem presented by 
dynamic pages cached in proxy servers is that it is essential 
for the cache pages to be current at all limes. 

Therefore, it would be advantageous to have an improved 
or alternative mechanism for caching and handling dynamic 
content. 

SUMMARY OF THE INVENTION 
^0 The present invention provides a method and apparatus in 
a data processing system for caching data in an internal 
cache and in an external cache A set of fragments is received 
for caching. A location is identified to store each fragment 
within the pltirality of fragments based on a rate of change 
of data in each fragment. The set of fragments is stored in the 
internal cache and the external cache using the location 
identified for each fragment within the plurality of frag- 
ments. 

go BRIEF DESCRIPTION OF THE DRAWINGS 

The novel features believed characteristic of the invention 
are set forth in the appended claims. The invention itself, 
however, as well as a preferred mode of use, further objec- 
tives and advantages thereof, will best be understood by 

65 reference to the following detailed description of an iUtis- 
trative embodiment when read in conjimction with the 
accompanying drawings, wherein: 
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FIG. 1 depicts a pictorial representation of a distributed connected together within distributed data processing sys- 

data processing system in which the present invention may tem 100. Network 102 may include permanent connectioos, 

be implemented; such as wire or fiber optic cables, or temporary connections 

FIG. 2 is a block diagram of a server system depicted in ^^^^ through telephone connections, 

accordance with a preferred embodiment of the present S depicted example, a server 104 is connected to 

invention; network 102 along with storage imit 106. In addition, clients 

T^r- 1- 1-1 1 J- J • J . • 108, 110, and 112 also are connected to a network 102. 

nG. 3 IS a block diagram depicUng a data processing ^^^^^ ^^g^ ^2 may be, for example, 

system that may be implemented as a server m accordance personal computers or network computers. For purposes of 

with a preferred embodiment of the present mvention; this appUcaUon, a network computer is any computer, 

FIG. 4 is a block diagram illustrating a data processing coupled to a network, which receives a program or other 

system in which the present invention may be implemented; application from another computer coupled to the network. 

nG. 5 is a diagram illustrating examples of different ^° depicted example, server 104 provides data, such as 

update rates and caching for pages depicted in accordance ^^^^ operadng system images, and appUcations to 

with a preferred embodiment of the present invention; .5 cUents 108-112. Clients 108, 110, and 112 are cUents to 

, . ^ , . server 104. Distnbuted data processmg system 100 includes 

FIG. 6 IS a dia^am illustraUng page fragments depicted ^ ^^^^ ^^^^^^ ^^^^ p^o^^^ ^^t^ ^^^^^ 

m accordance with a preferred embodiment of the present 108-U2. Server system 114 may take various forms. For 

mvention; example, server system 114 may consist of two or more 

FIG. 7 is a diagram of metadata used to cache a fragment servers that have been logically associated with each other 

depicted in accordance with a preferred embodiment of the 20 or may actually be interconnected as a cluster. Distributed 

present invention; data processing system 100 may include additional servers, 

FIG. 8 is a diagram illustrating statements for use in a ^'^^ ^^^^^^ 

RequestDispatcher.include( ) method depicted in accor- depicted example, distributed data processmg sys- 

dance with a preferred embodiment of the present invention; ^'"^ J^,^ f ^^^J^ representing a 
„ . .„ . . : , ,.25 worldwide collection of networks and gateways that use the 

FIG. 9 is a diagram illustratmg mstructions used to add a jcP/IP suite of protocols to communicate with one another, 

get cache method to a JSP depicted in accordance with a ^t the heart of the Internet is a backbone of high-speed data 

preferred embodiment of the present invention; communication lines between major nodes or host 

FIG. 10 is a diagram of instructions used in a service computers, consisting of thousands of commercial, 

method depicted in accordance with a preferred embodiment 30 government, educational and other computer systems that 

of the present invention; route data and messages. Of course, distributed data pro- 

FIG. 11 is an example of a JSP depicted in accordance cessing system 100 also may be implemented as a number 

with a preferred embodiment of the present invention; of different types of networks, such as, for example, an 

HG. 12 is a data flow diagram iUustrating the handling of a local area network (LAN), or a wide area network 

an external request from a client for content depicted in 35 (WAN). HG. 1 is intended as an example, and not as an 

accordance with a preferred embodiment of the present architectural limitation for the present invention, 

invention; With reference now to FIG. 2, a block diagram of a server 

HG. 13 is a data flow diagram showing handling of a data ^y^'^^^ ^^^icted in accordance with a preferred embodi- 

ID or fragment ID invalidation depicted in accordance with °^ P'««°' invention. Server system 200 is an 

a preferred embodiment of the present invenUon; «"^Pl« °^ » ffr^iy?'*,"'^"'''' '^''^ ^ 

r-T^ ' . a J- -11 . . server system 114 m FIG. 1. Server system 200 illustrates a 

FIG. 14 IS data flow diagram ilh^raUng aggressive rer- ^..^hanism in which a web application achieves throughput 

endenng depicted m accordance with a preferred embodi- , ju- u -i u-i t i * 

, . . scale up and high availabihty. In this example, server system 

ment of the present mvenUon; ^00 contains \^b server nodes 202 and 204, which execute 

FIG. 15 IS a data flow diagram illustrating data flow for ^eb application servers 206 and 208 respectively. The use 

handhng an extemal request using multiple JVMs depicted the term "server^* may refer to either the physical data 

in accordance with a preferred embodiment of the present processing system or to an application providing receiving 

mvention; processing requests from a cUent. Similarly, the client 

FIG. 16 is a data flow diagram illustrating handling a data may be a physical data processing system or an apphcation. 

ID or fragment ID invalidation, as well as handling time 50 For example, a client apphcation would be a Web browser, 

hmit expiration for a multi JVM system depicted in accor- Server system 200 also includes a sprayer 210 which is 

dance with a preferred embodiment of the present invention; connect to Web server nodes 202 and 204. Sprayer 210 is a 

aiid TCP/IP sprayer and acts as a cache that is external to Web 

FIG. 17 is a data, flow diagram flow for handling aggres- server nodes 202 and 204. A sprayer performs the function 

sive re rendering using multiple JVMs depicted in accor- ss of spreading requests to multiple webserver/application 

dance with a preferred embodiment of the present invention. server nodes. Not aU sprayers have the proxy function. A 

ncTAif cri nccr-DTimr^KT r^c Tur: Sprayer can also perform the function of a proxy server. The 

DETAILED DESCRIPTION OF THE sprayer is closer to the cUent than the apphcation server and 

PREFERRED EMBODIMENT j^^y contain its own cache. Sprayer 210 is used to 

With reference now to the figures, FIG, 1 depicts a 60 distribute HTTP requests evenly among the web servers, 

pictorial representation of a distributed data processing such as Web servers located in Web server nodes 202 and 

system in which the present invention may be implemented. 204. Sprayer 210 typically uses a time out to detect when 

Distributed data processing system 100 is a network of one of the Web server nodes can no longer respond, so it can 

computers in which the present invention may be imple- be taken off the sprayer's list of available nodes and a 

mented. Distributed data processing system 100 contains a 65 notification sent to an administrator, 

network 102, which is the medium used to provide commu- Web server nodes 202 and 204 typically coordinate via 

nications links between various devices and computers shared data (e.g., a distributed file system or database 
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system). These data mechanisms can themselves scaleup via local bus 406 may be made through direct component 

their own foraa of parallelism. In the depicted example, this interconnection or through add-in boards. In the depicted 

sharing of data is accomplished through shared parallel example, local area network (LAN) adapter 410, SCSI host 

database 212, which contains an interconnect 214 providing bus adapter 412, and expansion bus interface 414 are con- 

a connection between storage devices 216-222. Tliese stor- 5 nected to PCI local bus 406 by direct component connection, 

age devices form a cluster and are shared by both Web server contrast, audio adapter 416, graphics adapter 418, and 

node 202 and Web server node 204, These storage devices audioA^ideo adapter 419 are connected to PCI local bus 406 

contain relational databases from which content is pulled by ^^^'^^ ^"^^^^^ inserted into expansion slots. Expansion 

Web server nodes 202 and 204 to dynamically create pages. interface 414 provid^ a connecUon for a keyboard and 

In this example, these pages are distributed to sprayer 210 lO ^.Tcrif^^^' k ' T "^Jll^' and additional memory 

for further distribution to cUents. f^, host bus adapter 412 provides a con^^^^^^ 

_ , . . J ^ . , . hard disk dnvc 426, tape drive 428, and CD-ROM dnvc 430. 

The depicted conflguxation for server system 200 is j -^^ bus implementations wiU support three or 

mtended as an example and is not meant to miply architec- expansion slots or add-in connectors, 

tural Imiitations with respect to the present invention. . .. . , . . , 

„ , . r~ ^ operatmg system runs on processor 402 and is used to 

Referring to FIG. 3, a block diagram depicts a data 15 ^^^^^^^^ p^^^.j^ ^^^^^j „f ^^^^^ components 

processing system that may be implemented as a server, such ^thin data processing system 400 in FIG. 4. The operating 
as server 104 m FIG. 1, Web server node 202, or Web server ^y^^^ ^ commercially available operating system 
node 204, m accordance with a preferred embodiment of the 5^^^ oS/2, which is available from International Business 
present mvenljon. Data processing system 300 may be a Machines Coiporation. "OS/2" is a trademark of Intema- 
symmetric multiprocessor (SMP) system including a plura^^ 20 ^.^^^^ ^^^^ Machines CorporaUon. An object oriented 
ity ot processors 302 and 304 connected to system bus 306. proerammine system such as Java may run in conjunction 
A^temauvely, a smgle processor system may be employed. ^^^^^^^ ^y^j^^ ^^^^-^^ ^ ^^^^^^^ 
Also connected to system bus 306 is memory controller/ ^y^^^^ ^qj^ j^^a programs or applications executing on 
cache 308 which provides an interface to local memory 309 ^^^^ processing system 400. "Java" is a trademark of Sun 
I/O bus bridge 310 is connected to system bus 306 and ^ Microsystems, Inc. Instructions for the operating system, the 
provides an interface to I/O bus 312. Memory controller/ object-oriented operating system, and applications or pro- 
cache 308 and I/O bus bridge 310 may be mtegrated as grams are located on storage devices, such as hard disk drive 
depicted. 426, and may be loaded into main memory 404 for execution 

Peripheral component interconnect (PCI) bus bridge 314 by processor 402. 

connected to I/O bus 312 provides an interface to PCI local xhose of ordinary skiU in the art wiU appreciate that the 

bus 316. A number of modems may be connected to PCI bus hardware in FIG. 4 may vary depending on the implemen- 

316. Typical PCI bus implementations will support four PCI t^tion. Other internal hardware or peripheral devices, such as 

expansion slots or add-m connectors. Commumcations links flash ROM (or equivalent nonvolatUe memory) or optical 

to network computers 108-112 in FIG. 1 may be provided disk drives and the like, may be used in addition to or in 

through modem 318 and network adapter 220 connected to place of the hardware depicted in FIG. 4. Also, the processes 

PCI local bus 316 through add-in boards. of t^e present invention may be appHed to a multiprocessor 

Additional PCI bus bridges 322 and 324 provide inter- data processing system, 

faces for additional PCI buses 326 and 328, from which por example, data processing system 400, if optionally 

additional modems or network adapters may be supported. configured as a network computer, may not include SCSI 

In this manner, server 300 allows connections to multiple host bus adapter 412, hard disk drive 426, tape drive 428, 

network computers. A memory-mapped graphics adapter and CD-ROM 430, as noted by dotted line 432 in FIG. 4 

330 and hard disk 332 may also be connected to I/O bus 312 denoting optional inclusion. In that case, the computer, to be 

as depicted, either direcdy or indirectly. properly called a client computer, must include some type of 

Those of ordinary skill in the art will appreciate that the 45 network communication interface, such as LAN adapter 
hardware depicted in FIG. 3 may vary. For example, other 410, modem 422, or the like. As another example, data 
peripheral devices, such as optical disk drives and the like, processing system 400 may be a stand-alone system con- 
also may be used in addition to or in place of the hardware figured to be bootable without relying on some type of 
depicted. The depicted example is not meant to imply network communication interface, whether or not data pro- 
architcctural limitations with respect to the present inven- 5Q ccssing system 400 comprises some type of network com- 
mon, munication interface. As a further example, data processing 

The data processing system depicted in FIG. 3 may be, for system 400 may be a Personal Digital Assistant (PDA) 

example, an IBM RISC/System 6000 system, a product of device which is configured with ROM and/or flash ROM in 

International Business Machines Corporation in Armonk, order to provide non-volatile memory for storing operating 

N.Y, running the Advanced Interactive Executive (AIX) ss system files and/or user-generated data, 

operating system. The depicted example in FIG. 4 and above-described 

With reference now to FIG. 4, a block diagram illustrates examples are not meant to imply architectural limitations, 

a data processing system in which the present invention may For example, data processing system 400 also may be a 

be implemented. Data processing system 400 is an example notebook computer or hand held computer in addition to 

of a client computer. Data processing system 400 employs a 60 taking the form of a PDA. Data processing system 400 also 

peripheral component interconnect (PCI) local bus architec- niay be a kiosk or a Web appliance, 

ture. Although the depicted example employs a PCI bus. The present invention provides a method, apparatus, and 

other bus architectures such as Micro Channel and ISA may instructions for caching dynamic content. The mechanism of 

be used. Processor 402 and main memory 404 are connected the present invention is especially useful in caching Web 

to PCI local bus 406 through PCI bridge 408, PCI bridge 408 65 content, such as Web pages. In particular, the mechanism of 

also may include an integrated memory controller and cache the present invention may be used to provide caching for 

memory for processor 402. Additional connections to PCI Web pages containing data having different rates of change. 
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Ttoing to FIG. 5, a diagram illustrating examples of 
different update rates and caching for pages is depicted in 
accordance with a preferred embodiment of the present 
invention. FIG. 5 shows update rates for various types of 
pages as well as the type of caching that may be used. 5 

Generally, caching is more beneficial as the access rate 
increases and as the update rate decreases. All Web content 
is somewhat dynamic because everything changes 
eventually, even static pages. For example, if content 
changes very infrequently, then it is convenient for a human lo 
to republish the Web site whenever its content dianges. An 
example of this type of content is a typical home page. It is 
usually safe to enable browser and proxy server caching. 

If content changes so often that it is imreasonable to 
republish the Web site every time it changes, then a template, 
such as a Java Server Page (JSP), may be used to dynami- 
cally get the content from a file or database, and then render 
(i.e., format) the content into a HTML page. An ad service 
is an example of content that may change each time a page 
is requested. A JSP and a servlet are both mechanisms, ^ 
which use Java standards for programming dynamic content. 
A JSP is aimed at Web application designers and servlets are 
aimed at programmers. A JSP is compiled into a servlet for 
execution. In this case, static caching in browsers and proxy 
servers may be disabled, and dynamic caching may or may 
not be useful. 

If the content is constant over a large number of requests, 
then performance can be significantly improved by using 
dynamic caching. Examples of content that is fairly consis- 
tent are products in e-commerce and white pages as shown 
in FIG. 5. W^th dynamic caching, either time limit or a data 
ID iovalidation mechanism can be used to keep the content 
in the cache up to date. One way to view this case is that it 
automates the pubhshing process so that high update rates 
can be handled. 

If the content changes continuously, such as, for example, 
a ticker tape, any form of caching is a bad idea because 
caching overhead is suffered with no benefit. JSPs may be 
used to generate a page containing this content without any 
caching. 

Caching rendered HTML content offers performance 
improvements when the underlying dynamic content has not 
changed. One performance improvement is avoiding access 
to backend servers to get the dynamic content. The access 45 
avoided to backhand servers includes, for example, access- 
ing a database, a transaction monitor, an internal application, 
or a news service. Additionally, rendering the dynamic 
content into HTML is avoided. 

Caching the underlying dynamic content (i.e., data) 50 
instead of HT^L requires rendering the data into HTML 
during the fast path (i.e., when there is a cache hit). 
However, the fast path occurs more often. When the same 
data is rendered in multiple ways, caching rendered HTML 
requires accessing the backend server once for each render- 55 
ing. The tradeoff is difficult to make because it is between 
avoiding a typically less expensive activity (i.e., 
rerendering) vs. infi-equently avoiding a typically more 
expensive activity (i.e., backend access). The processes of 
the present invention may provide advantages for caching of 60 
data and in particular for caching rendered pages, such as 
those in HTML format. Rendering occurs when a server 
template (eg, a JSP) is executed and it returns a format which 
can be read by its requester. Reader is an agent of the 
requester which can display the rendering to its client 65 
(another person or program). Readers would include HTML 
browsers. Acrobat pdf viewers, or Inlemet agents. A frag- 
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ment is rendered the first time by an external HTTP request 
asking for contents of a servlet/JSP/ASP/cgi program to 
execute. Rerendering is the process of reusing the stored 
request object to re-render the same fragment by 
re-executing the servlet/JSP. 

The change rate or update rate also determines whether 
dynamic content (via a template) shoixld be used instead of 
static content. Static content reqiiires redoing the pages and 
republishing the site wherever something changes. A 
dynamic content template goes to the database or file system 
to read the latest data, formats it into HTML (called 
rendering), and reUims it to the client. For sites that have 
something that changes often, dynamic content templates 
make the site manageable. 

With reference now to FIG. 6, a diagram illustrating page 
fragments is depicted in accordance with a preferred 
embodiment of the present invention. One mechanism pro- 
vided by the present invention to provide improved caching 
of content is to define content in a page as one or more 
fragments and cache the fragments individually, rather than 
as a single page. 

A firagment is a part or all of a rendered HTML page 
which can be cached. A fragment can contain 0 or more child 
fragments, and can be contained by 0 or more parent 
fragments, forming a directed acyclic graph (DAG). FIG. 6 
illustrates a returned page 600, which is a product display 
page. Page 600 is a "top-level" fragment made up of 5 child 
fragments. Page 600 includes a product gif URL fragment 
602, a product detail fragment 604, a personalized greeting 
fragment 606, a shopping cart fragment 608, and an ad 
service fragment 610. The fragments depicted in FIG. 6 are 
shown in order of increasing rate of change for the under- 
lying content in the fragment with product gif URL 602 
having the slowest rate of change and ad service fragment 
610 having the fasted rate of change. Product gif URL 
fragment 602 contains a hypertext reference (HREF) that 
specifies the URL of the file for an image that shows what 
the product looks like. 

Product detail fragment 604 in this example may be a 
formatted table that includes the detailed description of the 
product with details, such as the product order number, 
name, options, and price. Personahzed greeting fragment 
606 is a fragment including a greeting that is directed 
towards the user, such as, for example, "Hello, John! Wel- 
come to Acme Corp.". Shopping cart fragment 608 in this 
example is a formatted shopping cart, including the order 
number, name, quantity and price of the products that have 
been chosen for possible purchase. 

Ad service fragment 610 includes a HREF for an image 
that displays an advertisement. The advertisement href is 
different each time a page is sent to a shopper. This makes 
page 600 as a whole too volatile to cache. However, frag- 
ment granularity allows the other portions of page 600 to be 
cached. 

The HREF to the product image in product gif URL 
fragment 602 and the detailed product description in product 
detail table fragment 604 are excellent candidates for frag- 
ments to be cached because the underlying data of a par- 
ticular product changes infrequently. However, the underly- 
ing data of some product changes far too frequently for static 
pubhshing. 

The personalized greeting in personalized greeting frag- 
ment 606 has the lifetime of a user session, but only for a 
particular shopper. It may be used several times within a 
fairly short time interval. Thus, personalized greeting frag- 
ment 606 is a good candidate for dynamic caching. Shop- 
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ping cart fragment 608 changes multiple times within a user to null and fragment ID 702 to be put on a queue to be 

session (every time something is added or the quantity rerendered when idle time pennits. When this option is 

changes), so it is not as good a candidate for dynamic turned off, a time limit or data ID invalidation causes 

caching as the personalized greeting. Lf, however, shopping fragment 700 to be removed from the cache. It will be 

cart fragment 608 is included on every page returned to the 5 rerendered and put back in the cache only when needed by 

shopper, then shopping cart fragment 608 is typically an external HTTP request. 

returned several times between changes, so there is a rea- Aggressive rerendering can exploit the idle time of the 

sonable case for caching it. The advertisement HREF in ad server by rendering pages in a backgrouod mode. The use of 

service fragment 610 is a poor candidate for caching because this option is simUar to automating the pubhshing process, 

the hit ratio would be zero and caching has iU own overhead 10 ^^^^ may be used to set data IDs that represent the 

(i.e storing it m the cache and invalidatmg it). Of course, underlying data which cause the fragment to be invalidated, 

each child fragment may contam additional fragments, Fr^^gmcni 700 can use zero or more pieces of data, so that 

T\immg next to FIG. 7 a diagram of metadata used to fragment700canhavezeroormoredataIDs. Apiece of data 

cache a fragment is depicted in accordance with a preferred can be used in one or more fragments, so a data ID can have 

embodiment of the present invention. Fragment 700 is 15 ^^^^ fragments. When the raw data changes, then its 

stored in a cache, which is referred to in these examples as data ID is used to invahdate all the fragments that depend on 

a "fragment cache". In FIG. 7, fragment 700 includes a it. It must be unique within the same scope as the fragment 

fragment ID 702, which identifies fragment 700 within a id. When a piece of data is used in only one fragment, the 

fragment cache. Fragment ID 702 is selected to be unique data ID can be the same as the fragmeat ID. 

within a fragment cache. This uniqueness may be with 20 depicted example, the fragment ID will always be 

respect to a process withm a server (if each server process of the data IDs. When a piece of data is used in multiple 

has its own fragment cache instance), a server (if a fragment fragments, its data ID would be different from either of the 

cache instance is shared across the processes within a fragment IDs. A database trigger might be used to create the 

server), or a cluster of servers (if a fragment cache instance external event that initiates the invalidation, 

is shared across multiple nodes in a c^ 25 ^^^^ ^ ^^^^^^ ^^.^ ^^^^ 

such as fragment ID 702, is typicaUy computed using „,etadata that may be used to set a maximum time hmit that 

processes on a JSP template. Fragment ID 702 can be the ^ . -nn u u u u j -^i. * •* i u ■ 

,Tw>x . TTT^r . .. 1 fragment 700 should be cached without its value being 

URL the URL pliis some request attnbutes, or not directly ^^^^ ^ granularity is configurable. For the web 

related to the URL. j , .■ • . ■ , ..• j . u .1, 

developer, this is sunpler than setting data IDs because there 

Fragment ID 702 can be used to enforce security by „e no triggers to write. However, it causes many fragments 

adding user information. The above example of a product to be discarded and rerendered when they are actually still 

description fragment does not enforce security. It uses an ID valid. A negative value implies that no time hmit exists. 

that includes the fact that it is a product detail fragment and ^^^^^ ^^j^^^^ ^^^^ ^.^^j^ ^„ ^ 

Ifn/Z^ls^', ".""^ P^°''^«=>'5«»fl- 35 group of external caches that are written to when a page- 

TA23434 ). However, if shopper groups are supported that j^^^j ^ j ^^^^^^^ „^ invalidated. A page level 

have different pnces and/or a different subset of the j ^ invalidated when the page fragment is invali- 

products, then the ID might also mclude the shopper group dated 

(e.g., "productDetail— SG188— TA23434"). Placing this ^ ' ,™ , , ■ , -rio u- u • *u u a 

jj-*- ^ ' c ' c * Tr» * u Fragment 700 also contams value 718 which IS the cached 

additional information m fragment ID 702 prevents a shop- , ^ r r . r . . • . -u 

*t. * • * ■ *u L c • t u • *u 40 rendering of the fragment. For a fragment that contains child 

per that is not in the shopper group from sunply changing the . , ^ . , ^ • .1 * - 

URL to obtain or acce^ cadied information, provided the ^^^me^te, the fri^eat can be stored m the cac*e ,n either 

cache is accessed within the application after authentication. ° ^ ° owing onus. 

Fragment 700 also contains a URL 704, which is the URL ^^^^ ^^^^^ ^^^^"^^"^ 

relative to the server for this fragment. For a top-level Witheachchildfragmeotrepresentedby alinktothe child 

fragment (e.g. a JSP that is externally requested), this could fragment which can be expanded when the parent is 

be obtained from the HTTP request object's URL. For a returned. Each link includes both the chUd's fragment ID 

contained fragment, this is the JSP file name URL. The fragment ID is needed to expand the chHd in 

Fragment 700 also contains a cache it 706 indicator, ca^ when the child fragment is in the cache. The URL 

which is metadata identifying whether the fragment should 5^ f ""^'^'f'^ '"^.^^ ''^^t^ ^ "^^.h r^'w 

, . J ^ . . 11 • J • • L . " fragment is not m the cache. The advantages 01 the Unked 

be cached. Cache it 706 allows a runlmie decision about "-6"^*^"^ ^ ^ *^ « u moiv^u 

whether a particular instance (fragment) of the JSP should be representaUon are: 

cached. For example, on a product display JSP, it may be Reduced storage because the chfld fragments are not 

known that products from a certain category are either replicated. 

accessed too infrequently or changed too frequently to be 55 The parents does not have to be invalidated when a 

cached worthwhile caching. contained child is invalidated. 

Next, pinned 708 provides an option of aUowing least The child fragment's cache access frequency statistics are 

recently used (LRU) replacement to be applied to the more accurate because the child is accessed from the cache 

fragment. Reasons for choosing the '*pinned" option might each time it is used. 

be real-time requirements or the fact that the web developer All of the above caching metadata illustrated in FIG. 7 is 

knows better than a LRU algorithm, supplied in the cached fragment's JSP. Ibis information is 

Aggressive rerender 710 is metadata that is used to supplied using a Fragmentlnfo object, which is a simple 

indicate whether to aggressively rerender a fragment when struct object. 

it is invalidated or when its time limit expires, instead of Application tailorable ids are also provided for in accor- 

waiting for an external request that needs the fragment. With 65 dance with a preferred embodiment of the present invention, 

this aggressive rerender option, either a lime limit or data ID AH of the ids are determined by the application. For 

invahdation may cause the value of fragment 700 to be set example, a cache entry for a product display page might 
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have the form of "productDisplay — TA23724", where the JSP template page granularity to be made as to whether that 
"productDisplay-" part is the same for all product display template would be pushed to an external cache versus an 
pages and the "TA23724'* is the individual product ID that internal cache. Those templates that satisfy the above con- 
is different for each prodnct. ditions could exploit the cost-performance advantage of an 

The API for including in a page another servlet, another 5 external cache. Other templates could use the flexibiUty of 

JSP or an HTML file is a Requestdispatcher.includef ) the intemal cache. In the figures desmbed below, a mecha- 

method. A JSP that uses caching would be called using this ^ implementing internal and external caches are 

same mechanism. The called JSP contains all of its own in accordjmce with a preferred embodim^^^^^ 

cachin metadata present invention. After a page has been rendered, the 
5 r « 1- - in resulting fragment is pushed to the cache coordinator. The 

With reference next to FIG. 8, a diagram lUustratmg ^^^.^e coordinator looks at the "externalCacheGroupId^' of 

statements for use in a RequestDispatcher.include() method fragment. If the cache group is a valid cache group 

IS depicted in accordance with a preferred embodiment of defined in this environment, then it will push the fragment 

the present invention. The statements in FIG. 8 represent external caches in that cacheGroup. Mapping of 

instructions that are added to a JSP that is to be cached. extemalCacheGroup ID to a Hst of aU adapters that are a part 

With reference now to FIG. 9, a diagram illustrating of that extemalCacheGroup. As elements are pushed to the 

instructions used to add a get cache method to a JSP is various external caches, some information is maintained to 

depicted in accordance with a preferred embodiment of the invalidate. This information includes, for example, mapping 

present invention. These instmctions are used to allow the of fragment data IDs to URLs and mapping of URls 

JSP to obtain a cache ID. Next, in FIG. 10, a diagram of (templates) to URLs. Both are used to invalidate a large 

instructions used in a service method is depicted in accor- number of rendered pages in the external caches depending 

dance with a preferred embodiment of the present invention. on either data IDs or then template (URI). FIGS. 12-14 

The service method is th e code that is executed in a provide illustrations of data flow used in caching dynamic 

JSP/servlet when an HTTP (do post, do get) request is sent. content for Web apphcations. The examples in FIGS. 12-14 
The service method contains the logic to "render** a page. ^ are all described with respect to a single Web application 

Turning now to FIG. 11, an example of a JSP is depicted server (WAS) Java virtual machine (JVM). Java™ is an 

in accordaDce with a preferred embodiment of the present object oriented programming language and environment 

invention. JSP 1100 includes instructions for defining meth- focusing on defining data as objects and the methods that 

ods on servlets, which can be called to obtain the ID of the may be applied to those objects. Java supports only a single 

page before the rest of the page is cached or run. "<%! %>" inheritance, meaning that each class can inherit from only 

defines a function to return the "cache ID" of this instance one other class at any given time. Java also allows for the 

given an HTTP request and response. This will be called creation of totally abstract classes known as interfaces, 

before any rendering happens. "<% %>" defines the rest of which allow the defining of methods that may be shared with 

the metadata for the fragment. When caching fragments, two several classes without regard for how other classes arc 

types of locations may be used, an external cache and an handling the methods. 

internal cache. A wide variety of caches exist that are The Java virtual machine (JVM) is a virtual computer 

external to the web application server. For example, external component that resides only in memory. The term "JVM" is 

caches may include those in a web server, operating system, a Java specific term for "address space". As used herein, 

and a sprayer. In the depicted examples, an intemal cache is JVM is interchangeable with "address space". The JVM 

a cache that is part of the web application server engine. allows Java programs to be executed on different platforms 

When a JSP template page containing requests for dynamic as opposed to only the one platform for which the code was 

content is executed, the resulting rendered fragment is compiled. Java programs are compiled for the JVM. In this 

cached. maimer Java is able to support applications for many types 

An external cache offers better cost-performance than a of data processing systems, which may contain a variety of 
web application server which supports the flexibility and 45 central processing units and operating systems architectures, 

protection required for application code. An external cache To enable a Java application to execute on different types of 

can even be supported in data processing systems that do not data processing systems, a compiler typically generates an 

have the overhead of an operating system (e.g. multiple architecture-neutral file format — the compiled code is 

processes/threads, memory management). Also, external executable on many processors, given the presence of the 

caches are closer to the client. These type of systems can Java run time system. The Java compiler generates bytecode 

have a very large cost-performance advantage. instructions that are non-specific to particular computer 

The advantages of an internal cache include fragment architectures. A bytecode is a machine independent code 

granularity that can be exploited. For example, if part of a generated by the Java compiler and executed by a Java 

page is too voladle to cache, the rest of the page can still be interpreter. 

cached. With an external cache, only whole pages, also 55 A development environment, such as the Java Develop- 

referred to as fragments can be cached. Consequently, these ment Kit (JDK) available from Sun Microsystems, Inc., may 

pages cannot contain content that is too volatile to cache. be used to build Java byte code from Java language source 

A further advantage of an internal cache is that access code and libraries, 

control can be enforced, causing a page to be accessible to Turning now to FIG. 12, a data flow diagram Ulustrating 
only a selected group. For example, the product description eo the handling of an external request from a client for content 

pages may be different for different shopper groups. With an is depicted in accordance with a preferred embodiment of 

external cache, authentication is not performed except for the present invention. In running a JSP, the JSP is compiled 

firewall enforcement, so the pages must be accessible to into a servlet and the servlet is compiled into a class file if 

everyone within that security domain, a compfled servlet is not more recent than a corresponding 

A Web application server of the present invention includes 65 JSP. 

an internal cache that can also push results of selected pages The execution of a JSP is triggered by an incoming 

to external caches. This mechanism aflows a decision at a request for a page received in JVM 1200 by JSP engine 1202 
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(step el). This request in the depicted example is a in more detail below. When an HTTP request is made, the 

RequestDispatcher.include( ) request in a HTTP format. This request flows through the external cache resource before the 

request may be received from a client or from another JSP request makes it to the Web application server. TTie external 

in JVM 1200. JSP engine 1202 then calls a getCacheld cache resource will realize it has the value the request is 
method on JSP 1204 (step e2). In this example, with the s looking for and return it to the client instead of burdening the 

HTTP request, JSP 1204 is a top level JSP, which may Web application server from rendering or retrieving it from 

contain a HTML in line or may contain request to include a the WAS cache. 

static HTML, Java code to dynamically generate HTML, or Turning next to FIG. 13, a data flow diagram showing 

request execution of another JSP using the handling of a data ID or fragment ID invalidation is depicted 
Requ6stDispatcher.mclude( ) method. lo in accordance with a preferred embodiment of the present 

JSP engine 1202 receives a response from JSP 1204 (step invention. In this example, time limit daemon 1210 main- 

e3). The JSP engine tries to determine whether the HTML tains a list of fragments and time limits for expiration. In 

content is cached from the returned response. This detenni- these examples, a change to a value in a fragment results in 

nation is made by sending a get value request to fragment an update to a time limit for the fragment. This update 
cache 1206 (step e4). The request for the value is made using ^5 includes increasing the expiration time to be equal to the 

the response returned from JSP 1204. A response is returned current time remaining plus the time limit for the fragment, 

by fragment cache 1206 to JSP engine 1202 (step e5). If In other implementations, other mechanism may be used, 

cached content is not present in a response from fragment For example, the current time remaining may be reset to the 

cache 1206, JSP 1204 is executed by calling the JSP's lime limit. 

service method (step e6). Step e6 occurs without executing ^ in FIG. 13, time limit daemon 1210 wakes periodically to 

step e4 if the response returned in step e3 is null. see if any fragments have expired. If a fragment has expired, 

In response, JSP 1204 accesses a relational database time limit daemon 1210 sends an invalidation signal to 

(RDB) 1209 to obtain dynamic content by sending a mes- cache coordinator 1208 (step tl). Upon receiving notifica- 

sage to RDB 1209 to obtain raw content (step e7). This tion that a fragment's time limit has expired, cache coordi- 
message may be, for example, an SQL query or a call to ^ nation 1208 sends a call to the caches to indicate that the 

stored procedure. RDB 1209 may be located, for example, fragment is no longer valid. In this example, cache coordi- 

at a back end server. A response is returned containing the nator 1208 sends an invalidate message to fragment cache 

raw or dynamic content (step e8). JSP 1204 will then render 1206 (step t2), which results in a response or reply being 

the dynamic content into a HTML format. Once the render- returned to cache coordinator 1208 (step t3). The message 

ing has been performed (logic in the servlet/JSP has been includes the fragment ID. In addition, cache coordinator 

executed) the fragment contains both the value (the 1208 sends an invalidate page message to external cache 

rendering) and the metadata required to maintain the frag- adapter 1212 (step t4). A response also is returned by 

ment in the cache. The resulting HTML content is sent to external cache adapter (step t5). These responses indicate the 

JSP engine 1202 by JSP 1204 (step e9). In turn, this content success or failure to invalidate the fragment. Cache coordi- 

is placed into fragment cache 1206 by JSP engine 1202 (step nator sends a reply back to time limit daemon 1210 (step t6). 

elO). Fragment cache 1206 then calls cache coordinator Additionally, a data ID or fragment ID invalidation may be 

1208 to write the fragment for handling by cache coordina- initiated by calling either fragment cache 1206 or cache 

tor 1208 (step ell). Cache coordinator 1208 functions to coordinator 1208. A JSP, such as JSP 1204, has access to its 

maintain all of the caches in synchronization. local fragment cache, fragment cache 1206, using a static 

If the fragment has a time limit, cache coordinator 1208 variable and may send a caU through JSP engine 1202 to 

wiU send an update message to time limit daemon 1210 with invalidate a fragment (step jl). The fragment cache imple- 

the new value to update the expiration time (step el2). A ments this call by forwarding the call to cache coordinator 

response is returned by time limit daemon 1210 to cache 1208 (step j2). At that point, steps t2-t5, as described above, 

coordinator 1208 (step el3). occur to invalidate the fragment. 

If the fragment is for an external cache, then the page is If data is updated by a source other than JSP 1204 within 

written to an external cache adapter 1212, which results in JVM 1200, then the use of a database trigger for invalida- 

the HTML content being sent to an external cache (step el4). lions is employed in the depicted example. In accordance 

The fragment may be assigned to a cache group. In such an with the preferred embodiment of the present invention, a 
instance, the fragment is written to each cache in that group, jq database trigger is employed by inserting a record into 

External cache adapter 1212 is a local adapter object that validation table 1300 in the database in RDB 1209. The 

encapsulates the protocol used by the external cache. In record in this example includes a time stamp column 1302 

other words, external cache adapter 1212 serves as an and a data ID column 1304. The time stamp in time stamp 

interface to the external cache or caches. A response is column 1302 wiU indicate when the trigger record was 

returned by external cache adapter 1212 to cache coordina- created in invalidation table 1300. The data ID in data ID 

tor 1208 (step el5). This response indicates whether the column 1304 represents the data that has been updated, 

write was successful. The response is then sent to fragment Data IDs can refer to whatever granularity is appropriate, 

cache 1206 (step el6), which in turn is sent to JSP engine In the above example, an update to a product record with a 

1202 (step el7). The HTML content is then returned to the primary key of "TA23434" would trigger inserting a record 

party sending the RequestDispatcher.include( ) message in the invalidation table with an data ID of "product — 

(step el8). TA23434". 

With reference again to step e5, if the HTML content is Invalidation daemon 1214 is a thread that runs periodi- 

cached, JSP 1204 does not have to be executed and the cache cally to perform a cache invalidation algorithm each time it 

content may be retumed to the calling entity requesting the wakes up. Invalidation daemon 1214's only state is the latest 
P^S^- 65 timestamp it has already processed and the length of its wake 

JA^ 1200 also includes an invalidation daemon 1214 and interval. Invalidation daemon 1214 reads records from 

a re render daemon 1216, whose functions will be described invalidation table 1300 in RDB 1209 (step il). The records 
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read from iavalidation table 1300 are those where the content is sent to JSP engine 1202 by JSP 1204 (step r6). In 

timestamp is greater than the latest timestamp it previoxisly turn, this content is placed into fragment cache 1206 by JSP 

processed. For the above example, the record with a data ID engine 1202 (step r7). Fragment cache 1206 then calls cache 

of "product — ^TA23434" would be found in response to coordinator 1208 to write the fragment for handling by 
reading invalidation table 1300 in step il. The results are s cache coordinator 1208 (step r8). Cache cooriginator 1208 

returned to invalidation daemon 1214 (step i2). functions to maintain all of the caches in synchronization. 

Invalidation daemon 1214 sends a notification to cache If the fragment has a time limit, cache coordinator 1208 

coordinator 1208 to invalidate any fragments with the speci- sends an update message to time limit daemon 1210 with the 

fied data ID (step 13), As a result, cache coordinator 1208 new value to update the expiration time (step r9). A response 

performs steps t2-t5 as described above. A response is sent is returned by time limit daemon 1210 to cache coordinator 

from cache coordinator 1208 to invalidation daemon 1214 1208 (step rlO). 

(step 14). In the depicted example, the fragment whose if the fragment is for an external cache, then the page is 

fragment ID was "productDetail— TA23434" would be written to an external cache adapter 1212, which results in 

invalidated. HTML content being sent to an external cache (step rll). 

Further, invalidation daemon 1214 wiU remember the The fragment may be assigned to a cache group. In such an 

latest timestamp in the retrieved records. Invalidation dae- instance, the fragment is written to each cache in that group, 

mon 1214 will sleep for a configured interval before awak- External cache adapter 1212 is a local adapter object that 

ing and performing the process again. encapsulates the protocol used by the external cache. A 

With respect to rerendering fragments, if a fragment's response is returned by external cache adapter 1212 to cache 

aggressive rerender option is not set and an invalidation coordinator 1208 (step rl2). This response indicates whether 

occurs, it is simply removed from the cache as soon as it is the write was successful. The response is then sent to 

no longer pinned as indicted by pinned metadata associated fragment cache 1206 (step rl3), which in tum is sent to 

with the fragment. If the aggressive rerender option is set rerender daemon 1216 (step rl4). 

and an invalidation occurs, the fragment's value is set to null The processes of the present application as described with 
and its fragment ID is put on a queue to be rerendered as respect to FIGS. 12-14 are in the context of a single JVM. 
soon as it is no longer pinned. For all fragments invalidated These processes also may be applied to multiple JVMs 
by a single invahdation notification, all of their values are set located on one or more servers. FIGS. 15-17 illustrate the 
to null prior to any being put on a rerender queue, so that software structure and data flow for dynamic content each- 
inconsistent fragments are avoided. The rerender queue is ing for web applications when multiple JVMs handle exter- 
located inside the FragmentCache. The fragment cache is nal requests. These JVMs may all be on the same node or 
responsible for adding and removing elements from the can span multiple nodes. 

queue. A configurable number of rerender threads are With reference now to FIG. 15, a data flow diagram 

present, which look in the queue to determine which frag- illustrating data flow for handling an extemal request using 

ments they can possibly render. The queue only contains multiple JVMs is depicted in accordance with a preferred 

cache IDs which have been rendered by this fragment cache embodiment of the present invention. In this example, a 

and can be rerendered by this cache. To be able to rerender coordinating JVM 1500 is present and will interact with Web 

a fragment, its request object is copied and saved in the apphcation server (WAS) JVM 1502. Coordinating JVM 

fragment cache along with the rest of the fragment's caching 1500 includes a cache coordinator 1504, a render coordina- 

metadata. The request object is saved only when the aggres- tor 1506, a fragment cache 1508, an external cache adapter 

sive rerender option is set. The request object is a sta ndard 1510, a time limit daemon 1512, and an invalidation daemon 

Servlet object, which encapsulates all the data of the HTTP 1514. WAS JVM 1502 contains a JSP engine 1516, JSP 

request made by the client. Analogous objects are present in 15I8, a fragment cache 1520, and a render daemon 1522. In 

cgi or ASP The request object contains information on the this example, although a single WAS JVM is shown for 

URL and parameters that were passed in the remote cHent purposes of describing data flow, coordinating JVM 1500 

machine as well as contents of the HTTP header (referring may handle multiple WAS JVMs in accordance with a 

page, what browser is calling, etc.). preferred embodiment of the present invention. 

Turning next to FIG. 14, data flow diagram illustrating The execution of a JSP is triggered by an incoming 

aggressive rerendering is depicted in accordance with a request for a page received in WAS JVM 1502 by JSP engine 
preferred embodiment of the present invention. Rerender 50 1516 (step el). This request in the depicted example is a 

daemon 1216 is a thread normally running at low priority. RequestDispatcher.include( ) request in a HTTP format JSP 

Rerender daemon 1216 will monitor the rerender queue for engine 1516 then sends a get value call to fragment cache 

work to perform. When items are present on the rerender 1520 (step e2). In turn, if the fragment needed for the request 

queue for processing, rerender daemon 1216 will increase its is not located within the fragment cache 1520, a read 
priority to normal, Rerender daemon 1216 sends a caU to 55 fragment call is sent to cache coordinator 1504 widiin 

fragment cache 1206 to rerender the fragment (step rl). coordinating JVM 1500 (step e3). Cache coordinator 1504 is 

Next, in response to the call to fragment cache 1206 to used as a cache coordinator for all the other JVMs. Cache 

rerender the fragment, fragment cache 1206 sends a request, coordinator 1504 is checked to see if the fragment is present 

RequestDispatcher.include( ) to JSP engine 1202 (step r2). because the fragment may have been rendered by anoflier 
In turn, JSP engine 1202 will request the content by sending go JVM (not shown) other than WAS JVM 1502. 

a service caU to JSP 1204 (step r3). Cache coordinator 1504 sends the read fragment request 

JSP 1204 will accesses a relational database (RDB) 1209 to render coordinator 1506 (step e4), which in turn sends the 

to obtain dynamic content by sending a message to RDB request to fragment cache 1508 in coordinating JVM 1500 

1209 (step r4), RDB 1209 may be located, for example, at (step e5). A response is returned by fragment cache 1508 to 
a back end server. A response is returned containing the raw 65 render coordinator 1506 (step e6). If the response does not 

or dynamic content (step r5). JSP 1204 then renders the inchide the fragment, render coordinator 1506 marks the 

dynamic content into a HTML format. The resulting HTML fragment as rendering work-in-progress. A response is then 
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sent back to cache coordinator 1504 (step e7). The response 
generated by the render coordinator will include either the 
fragment or an indication that the fragment was absent from 
cache 1508. 

In tUTQ, the response is sent to fragment cache 1520 in S 
WAS JVM 1502 (step e8). If a fragment is present in this 
response. The client cache will receive a fragment from the 
second level fragment cache if it is available at that second 
level. If the fragment is foimd, then the client cache will put 
the fragment in its cache. This response is relumed to JSP lo 
engine 1516 (step e9). If the response indicated that a 
fragment was absent, then the JSP is executed. The JSP is 
executed by calling the JSP's service method (step elO). If 
the fragment was present, the fragment is then returned to 
the caller requesting the content. More than one fragment is 
might be returned to the caller as a page or in other case, the 
fragment may make up the entire page. 

JSP 1518 accesses relational database (RDB) 1526 to 
obtain dynamic content by sending a message to RDB 1526 
(step ell). A response is returned by RDB 1526 containing ^ 
the raw or dynamic content (step el2). JSP 1518 then 
renders the dynamic content into a HTML format. The 
resulting HTML content is sent to JSP engine 1516 by JSP 
1518 (step el3). In turn, this content is placed into fragment 
cache 1520 by JSP engine 1516 (step 6l4). 

Thereafter, fragment cache 1520 then calls cache coordi- 
nator 1504 to write the fragment for handling by cache 
coordinator 1504 (step el5). The fragment is sent to cache 
coordinator 1504 to allow other JVMs lo access the frag- 
ment. Cache coordinator 1504 in tum sends the write 
fragment request to render coordinator 1506 (step el 6), 
which in turn sends the request to fragment cache 1508 (step 
el7). A response to the write fragment request is returned to 
render coordinator 1506 (step el8). If the response indicates 
that the fragment was successfully written, the fragment is 
unmarked as a rendering work-in-progress. The response is 
then passed on to cache coordinator 1504 (step el9). 

If the fragment has a time limit, cache coordinator 1504 
will send an update message to time limit daemon 1512 with 
the new value to update the expiration time (step e20). A 
response is returned by time limit daemon 1512 to cache 
coordinator 1504 (step e21). 

If the fragment is for an external cache, then the page is 
written to an external cache adapter 1510, which results in 45 
the HTML content being sent to an external cache (step e22). 
The fragment may be assigned to a cache group. In such an 
instance, the fragment is vmtten to each cache in that group. 
External cache adapter 1510 is a local adapter object that 
encapsulates the protocol used by the external cache. A 50 
response is retumed by external cache adapter 1510 to cache 
coordinator 1504 (step e23). This response indicates 
whether the write was successful. The response is then sent 
to fragment cache 1520 (step e24), which in turn is sent to 
JSP engine 1516 (step e25). The HTML content is then 55 
returned to the party sending the 
RequestDispatcher.include( ) message (step e26). 

Ttaiing now to FIG. 16, a data flow diagram illustrating 
handling a data ID or fragment ID invalidation, as well as 
handling time Limit expiration for a multi JVM system is 60 
depicted in accordance with a preferred embodiment of the 
present invention. Time limit daemon 1512 wakes periodi- 
cally to see if any fragments have expired. Hme limit 
daemon 1512 will send an invalidation signal to cache 
coordinator 1504 (step tl), 55 

Upon receiving notification that a fragment's time limit 
has expired, cache coordinator 1504 will send a call to the 
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caches to indicate that the fragment is no longer valid. In this 
example, cache coordinator 1504 sends an invalidate mes- 
sage to fragment cache 1508 (slep t2), which results in a 
response or reply being returned to cache coordinator 1504 
(step t3). Cache coordinator 1504 also sends an invalidate 
message to fragment cache 1520 in JVM 1502 (step t4). A 
reply as to the results of the request is turned to cache 
coordinator 1504 from fragment cache 1520 (step t5). In 
addition, cache coordinator 1504 sends an invalidate page 
message to external cache adapter 1510 (step t6). A response 
also is retumed by external cache adapter (step t7). These 
responses indicate the success or failure to invalidate a the 
fragment. Cache coordinator sends a reply back to time limit 
daemon 1512 (step t8). 

Additionally, a data ID or fragment ID invalidation may 
be initiated by calling either fragment cache 1520 or cache 
coordinator 1504. A JSP, such as JSP 1518, has access to its 
local fragment cache, fragment cache 1520, using a static 
variable and may send a call through JSP engine 1516 to 
invalidate a fragment (step jl). The fragment cache 1520 
implements this call by forwarding the call to cache coor- 
dinator 1504 in JVM 1500 (step j2). At that point, steps 
t2-t7, as described above, occur to invalidate the fragment. 

A database trigger for invalidations is employed in the 
depicted example. In accordance with a preferred embodi- 
ment of the present invention, a database trigger is employed 
by inserting a record into validation table 1600 in the 
database in RDB 1526. The record in this example includes 
a time stamp column 1602 and a data ID column 1604. The 
time stamp in time stamp column 1602 will indicate when 
the trigger record was created in invalidation table 1600. The 
data ID in data ID column 1604 represents the data that has 
been updated. 

As mentioned above with respect to validation in a single 
JVM, data IDs can refer to whatever granularity is appro- 
priate. 

Invalidation daemon 1514 is a thread that runs periodi- 
cally to perform a cache invalidation algorithm each time it 
wakes up. Its only state is the latest timestamp it has already 
processed and the length of its wake interval. Invalidation 
daemon 1514 reads records from invalidation table 1600 in 
RDB 1526 (step il). The records read from invalidation 
table 1600 are those where the timestamp is greater than the 
latest timestamp it previously processed. The results are 
returned to invalidation daemon 1514 (step i2). 

Invalidation daemon 1514 sends a notification to cache 
coordinator 1504 to invalidate any fragments with the speci- 
fied data ID (step i3). As a result, cache coordination 
performs steps t2-t7 as described above. A response is sent 
from cache coordinator 1504 to invalidation daemon 1514 
(step i4). In the depicted example, the fragment whose 
fragment ID was "productDetail— TA23434" would be 
invalidated. 

Further, invalidation daemon 1514 will remember the 
latest timestamp in the retrieved records. Validation daemon 
1514 will sleep for a configured interval before awaking and 
performing the process again. 

Turning now to FIG. 17, a data flow diagram flow for 
handling aggressive rcrcndering using multiple JVMs is 
depicted in accordance with a preferred embodiment of the 
present invention. When WAS JVM 1502 is idle, rerendcr 
daemon 1522 will increase its priority and identify a number 
of pages or fragments that are present in the re render key for 
processing. This identified list of fragments is sent to frag- 
ment cache 1520 (step zl). This list includes the fragment 
IDs along with the identification of WAS JVM 1502 in the 
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depicted example. Fragment cache 1520 forwards this list to 
cache coordinator 1504 (step z2), which in turn is forward 
to render coordinator 1506 (step 23). Render coordinator 
1506 processes the IDs for the fragments to determine 
whether a fragment within the list is in a rendering work- 5 
in-progress list. The work-in-progress list is maintained by 
render coordinator 1504 in these examples. Additionally, 
render coordinator 1506 determines whether the fragment is 
present in fragment cache 1508 by sending a query to 
fragment cache 1508 (step z4). The query includes the lO 
fragment ID. A response is returned by fragment cache 1508 
(step z5). 

If the fragment is absent from fragment cache 1508 and is 
not identified in the render work-in-progress list, render 
coordinator 1504 returns that fragment ID to the JVM that 15 
sent the fragment ID, WAS JVM 1502 for rerendering 
through cache coordinator 1504 (step z6). The fragment ID 
for rerenderiog is sent to fragment cache 1520 from cache 
coordinator 1504 (step 2^). 

Next, in response to the fragment ID from cache coordi- 20 
nator 1504, fragment cache 1520 sends a request, 
RequestDispatcher.include( ) to JSP engine 1516 (step z8). 
In turn, JSP engine 1516 executes JSP 1518 by calling a 
service method (step z9). JSP 1518 will access a relational 
database (RDB) 1526 to obtain dynamic content by sending 25 
a message to RDB 1526 (step zlO). A response is retumed 
containing the raw or dynamic content (step zU). JSP 1518 
then renders the dynamic content into a HTML format. This 
content becomes the value of the fragment after the servlet/ 
JSP has finished executing. The fragment metadata is cur- 
rently supplied by the JSP that is executed. The JSP engine 
provides some informatioa to the fragment to tell it when a 
page is being inserted and when new request attributes are 
added. The resulting HTML content is sent to JSP engine 
1516 by JSP 1518 (step zl2). In turn, this content is placed 
into fragment cache 1520 by JSP engine 1516 (step zl3), 35 
Fragment cache 1520 sends the fragment to then call cache 
coordinator 1504 to write the fragment for handling by 
cache coordinator 1504 in JVM 1500 (step zl4). 

The fragment is sent to render coordinator 1506 (step 
zl5), which in turn writes the fragment into fragment cache 40 
1508 (step zl6). A response as to the success of this write 
operation is retumed to render coordinator 1506 (step zl7). 
This response is relayed to cache coordinator 1504 (step 
zl8). 

If the fragment has a time limit, cache coordinator 1504 45 
sends an update message to time limit daemon 1512 with the 
new value to update the expiration time (step zl9). A 
response is returned by time limit daemon 1512 to cache 
coordinator 1504 (step z20). 

If the fragment is for an external cache, then the page is 50 
written to an external cache adapter 1510, which results in 
the fragment being sent to an external cache (step z21). The 
fragment may be assigned to a cache group. In such an 
instance, the fragment is written to each cache in that group. 
External cache adapter 1510 is a local adapter object that 
encapsulates the protocol used by the external cache. A 
response is retumed by external cache adapter 1510 to cache 
coordinator 1504 (step z22). This response indicates 
whether the write was successful. The response is then sent 
to fragment cache 1520 in WAS JVM 1502 (step z23). A 
response is retumed to JSP engine 1516 fr>r the set page call 
in step zl8 (step z25) with a response to call page being 
returned (step z26). Thereafter a response to the re render 
request is sent to rerender daemon 1522 (step z24). 

The present invention also incorporates a way to manage 
failed software components. When the cache coordinator 65 
sends a message to the internal or external caches, it uses a 
timeout when waiting for the reply. If the cache does not 
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respond within the timeout, the cache coordinator assumes 
the cache is no longer working, so it stops sending messages 
to the cache and sends a notification to a systems adminis- 
trator saying that the cache is not functioning. 

Thus, a coordinating JVM can significantly improve 
efficiency in a multi-JVM or multi-node configuration. Tliis 
efficiency is provided by adding a single coordinating JVM 
that contains a single cache coordinator, time limit daemon, 
invalidation daemon, and a render coordinator. Adding a 
rendering coordinator with its own fragment cache in the 
coordinating JVM reduces the number of renderings from 
once per WAS node to once for all WAS nodes. This includes 
both database access and rendering overhead. 

A single invalidation daemon significantly reduces the 
number of database accessed. In addition, a single invali- 
dation daemon and time limit daemon keeps the WAS JVMs 
in synch, so that a change shows up across all WAS JVMs 
at the same time. This avoids users seeing inconsistent 
results. A single invalidation daemon and time limit daemon 
reduces the number of messages for invalidations. 

Further, having the cache coordinator and time limit 
daemon in the same JVM reduces messages by piggyback- 
ing the notification of a changed fragment value from the 
WAS JVM to the limeLimitDaemon on the write fragment 
message from the WAS JVM to the CacheCoordinaton 

With this centralization, the data flows described above in 
FIGS. 15-17 for a multiple JVM system with a coordinating 
JVM flows differ from a single JV^ system as described in 
FIGS. 12-14 in the following ways: 

When rendered locally, the JSP engine puts it back in the 
local FragmentCache (steps el4 in FIG. 15 and zl6 in FIG. 
17). Hie local FragmentCache sends it to the CacheCoor- 
dinator (steps el5 in FIG. 15 and zl4 in FIG. 10), so that 
other caches can use it. For external caches, the cache 
coordinator behaves as it does in the single J\^ case and 
writes the fragment value aggressively (steps e22 in FIG. 15 
and z21 in FIG. 15). For fragment caches, the cache coor- 
dinator behaves differently than in the single JVM case and 
can provide a second -level cache. The cache coordinator can 
have its own fragment cache, which is used to increase 
performance as illustrated in FIGS. 15 and 17. Also, the 
cache coordinator can keep a table of which WAS JVMs, 
which have rendered which fragments. The cache coordi- 
nator can obtain a fragment out of the appropriate WAS JVM 
when the fragment is requested. The advantage of this 
approach is reduced memory. This feaUire is an alternative 
to keeping the fragments in the coordinating JVM*s frag- 
ment cache. 

Further, when the JSP engine tries to find the fragment in 
the local fragment cache and the fragment is absent from the 
local fragment cache, the fragment cache will try to find 
needed fragment in the cache coordinator (step e3 in FIG. 
15) because the fragment may have been rendered by 
another WAS JVM. If the fragment is not found in either the 
first or second level fragment cache, the render coordinator 
will mark the fragment as a work-in-progress and expect the 
first level cache to send it the fragment upon rendering (step 
elO in FIG. 15). A first level fragment cache is the fragment 
cache that is owned by the local application server process. 
The second level fragment cache is owned by the render 
coordinator in the multi-jvm scenario. With a coordinating 
JVM, only the single CacheCoordinator notifies the time 
Hmit daemon that a value has been updated (steps e20 in 
FIG. 15 and zl9 in FIG. 15). This effectively piggybacks this 
notification on the write fragment message (steps el5 in 
FIG. IS and zl4 in FIG. 17), taking advantage of the fact that 
all changes in fragment values goes through this coordinat- 
ing JVM. 

When a WAS JVM is idle, its rerender daemon asks its 
fragment cache to render the first N pages on the local 
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rerender queue (step zl in FIG. 17). The fragment cache 
forwards this via the cache coordinator (step z2 in FIG. 17) 
to the render coordinator to volunteer for rerendering work 
(step z3 in FIG. 17). The render coordinator processes these 
IDs in order, checking to see if the fragment is in the 
rendering wock-in-progress list or in the central fragment 
cache (step r4 in FIG. 10). If the ID is in neither, it returns 
that fragment ID for rerendering. Hie WAS JVM's fragment 
cache can assume that aU IDs prior to the returned ID in the 
list have already been rerendered, so its local rerender Queue 
can be updated accordingly. If an ID was returned, the WAS 
renders that fragment (step z8 in FIG. 17) and returns it to 
the RenderCoordinator (steps zl4/zl5 in FIG. 17) and goes 
back to sleep. 

As described above for the depicted example, the render 
coordinator assigns a WAS JVM to render a fragment in 
either of two ways. First, a WAS JVM asks for a fragment 
and it is not centrally available. This request returns null for 
the fragment value, indicating that the WAS JVM should 
render it locally. Second, an idle WAS JVM volunteers to 
rerender any of a list of fragment IDs and the RenderCoor- 
dinator returns one of these ID, indicating that the WAS 
JVM should render it locally. The render coordinator keeps 
track of any pending render jobs until they are done, so that 
the same one is not assigned to more than one WAS JVM. 
This centralized render coordination scheme does not need 
to copy across nodes the complex server state required to 
render or rerender a fragment. 

Instead, the scheme always uses a WAS JVM that already 
has the required server state that is saved from an external 
request. The render coordinator can use a lime out when 
waiting od a pending render job to be completed, so that a 
WAS JVM that has died does not cause problems for the rest. 

It is important to note that while the present invention has 
been described in the context of a fully functioning data 
processing system, those of ordinary skill in the art will 
appreciate that the processes of the present invention are 
capable of being distributed in the form of a computer 
readable medium of instructions and a variety of forms and 
that the present invention appHes equally regardless of the 
particular type of signal bearing media actually used to carry 40 
out the distribution. Examples of computer readable media 
include recordable-type media such a floppy disc, a hard 
disk drive, a RAM, and CD-ROMs and transmission-type 
media such as digital and analog communications links. 

The description of the present invention has been pre- 
sented for purposes of illustration and description, but is not 
intended to be exhaustive or limited to the invention in the 
form disclosed. Many modifications and variations will be 
apparent to those of ordinary skill in the art. For example, 
although the depicted examples involved content in the form 
of HTML pages being requested over the Web using HTTP, 
the present invention may be used in other enviroimients. 
For example, the process may be appUed to other Xyp&s of 
content, such as pages using extensible markup language 
(XML). Also, the sever processes do not have to reside on 
JVM's. Although the depicted examples are directed 
towards a JVM, the mechanism for the present invention 
may be applied to address spaces in other program language 
environments. Additionally, although the depicted examples 
show a particular sequence of writing to an internal cache 
and an external cache, the sequence of writing to internal 
and external caches may occur at any order or simulta- 
neously. The embodiment was chosen and described in order 
to best explain the principles of the invention, the practical 
application, and to enable others of ordinary skill in the art 
to understand the invention for various embodiments with 
various modifications as are suited to the particular use 
contemplated. 
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What is claimed is: 

1. A web server application for use in a data processing 
system comprising: 

an engine, wherein the engine receives and responds to 
requests for a page, wherein the page includes a plu- 
rality of fragments, wherein a first portion of the 
plurality of fragments are stored in an internal cache 
and a second portion of the plurality of fragments are 
stored in an external cache group; 

an interface to the extemal cache, wherein the external 
cache provides the first portion of the plurality of 
fragments stored in the extemal cache group to a chent 
data processing system sending a request for the page; 
and 

a cache coordinator, wherein the cache coordinator man- 
ages fragments located in the internal cache and the 
external cache, wherein the cache coordinator selec- 
tively sends the fragments to the extemal cache through 
the interface and wherein information on the fragments 
is maintained at the data processing system for use in 
managing the fragments sent to the external cache. 

2. The web server application of claim 1, wherein the 
cache coordinator synchronizes the internal cache and the 
external cache through controlling invalidation of contents 
within the internal cache and the extemal cache. 

3. The web server application of claim 1, wherein a 
plurality of fragments are stored in the internal cache and the 
external cache, 

4. The web server application of claim 3, wherein the 
cache coordinator stores an identification of the plurality of 
fragments. 

5. The web server application of claim 4, wherein the 
identification of the plurality of fragments includes a loca- 
tion of the plurality of fragments. 

6. The web server application of claim 4, wherein the 
cache coordinator manages invalidation of the plurality of 
fragments. 

7. The web server application of claim 6, wherein the 
cache coordinator manages rerendering of a fragment within 
the plurahty of fragments that has been invalidated. 

8. The web server application of claim 1, wherein the 
external cache is located within another data processing 
system. 

9. The web server application of claim 1, wherein the 
external cache is located within the data processing system. 

10. The web server application of claim 1, wherein the 
external cache includes a first external cache located in a first 
data processing system and a second extemal cache located 
in a second data processing system. 

11. The web server application of claim 1, wherein the 
web server application is executing in a Java virtual 
machine. 

12. The web server of claim 1, wherein the fragments sent 
to the external cache are selected by the cache coordinator 
to increase performance. 

13. The web server of claim 1, wherein access control is 
enforced on the internal cache and wherein the fragments 
sent to the external cadae are selected by the cache coordi- 
nator as fragments in which access control using authenti- 
cation is not required for access to the fragments. 

14. The web server of claim 1, wherein the cache coor- 
dinator sends a message to the extemal cache to invalidate 
an expired fragment within the fragments. 

15. The web server of claim 1, wherein the fragments sent 
to the external cache are all of the fragments within the 
plurality of fragments for the page. 

16. TTie web server of claim 1, wherein the external cache 
is a plurality of external caches. 
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